Proteins with greater influence on network dynamics 
evolve more slowly but are not more essential 
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A fundamental question for evolutionary biology is why rates of evolution vary dramatically be- 
tween proteins. Perhaps surprisingly, it is controversial how much a protein's functional importance 
affects its rate of evolution. In most studies, functional importance has been measured on the coarse 
scale of protein knock-outs, while evolutionary rate has been measured on the fine scale of amino 
acid substitutions. Here we introduce dynamical influence, a finer measure of protein functional 
importance. To measure dynamical influence, we first use detailed biochemical models of particular 
reaction networks to measure the influence of each reaction rate constant on network dynamics. We 
then define the dynamical influence of a protein to be the average influence of the rate constants for 
all reactions it is involved in. 

Using models of a dozen biochemical systems and sequence data from vertebrates, we show that 
dynamical influence and evolutionary rate are negatively correlated; proteins with greater dynamical 
influence evolve more slowly. We also show that proteins with greater dynamical influence are not 
more likely to be essential. This suggests that there are many cellular reactions whose presence is 
essential for life, but whose quantitative rate is relatively unimportant to fitness. We also provide 
evidence that the effect of dynamical influence on evolutionary rate is independent of protein ex- 
pression level, expression specificity, gene compactness, and reaction degree. Dynamical influence 
offers a finer view of functional importance, and our results suggest that focusing on essentiality 
may have previously led to an underestimation of the role function plays in protein evolution. 



Introduction 

Every protein evolves at a characteristic rate p], and 
these rates vary dramatically from protein to protein. 
Understanding this variation is a fundamental challenge 
for evolutionary biology. A predictive theory for evolu- 
tionary rates might also guide drug design toward targets 
that evolve slowly, lessening the problem of evolved drug 
resistance in pathogens. 

The recent revolution in genomics has enabled many 
factors to be examined for correlation with protein evo- 
lutionary rate; among them are expression level [2], gene 
compactness [3], knock-out essentiality [3], and interac- 
tion degree (F. Perhaps surprisingly, it is controver- 
sial how much a protein's functional importance impacts 
its rate of evolution [5] . Theoretical considerations sug- 
gest that functional importance should play a substan- 
tial role [7]. Studies in bacteria [8], yeast [4J, and mam- 
mals [3J have shown that the essentiality of a gene, i.e. 
the fitness effect of gene deletion, has only a weak (but 
statistically significant) correlation with its evolutionary 
rate. 

Protein evolutionary rate is most commonly by the ra- 
tio dN/dS of the rate of nonsynonymous (amino acid 
changing, dN) to synonymous (amino acid preserving, 
dS) genetic mutation. Protein importance is most com- 
monly assessed by the fitness effect of knocking that pro- 
tein out. Individual substitutions may, however, produce 
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only small changes in a protein's structure and, conse- 
quently, small changes in its functional efficacy. It is thus 
not clear that the fitness effect of a protein knock-out 
will correlate well with the fitness effect of small changes 
in protein efficacy, and this may explain the weak cor- 
relation seen between protein importance measured by 
knock-out and evolutionary rate. 

Here we take advantage of another recent revolution 
in biology, the rise of mathematical modeling, to take 
an alternative and much finer view of a protein's func- 
tional importance. We assess how small changes in the 
efficacy of a protein's reactions impact the dynamics of 
the network in which the protein is embedded. To do 
so, we use computational models of specific molecular 
networks. Each of these models condenses the results of 
many biochemical studies, and we use them to probe the 
fine scale influence of variation in reaction rate constants 
on the network dynamics [9] . We construct a measure of 
a protein's influence on network dynamics from the influ- 
ence of each constant for each reaction in which it is in- 
volved, and we correlate this with evolutionary rate. The 
models we consider were all constructed in mice, rats, or 
humans, but we expect conservation among vertebrates 
of network structure [TU] and dynamics, allowing us to 
assess evolutionary rate using more species. 

We find that proteins with greater dynamical influence 
evolve more slowly. Moreover, we find that a protein's 
dynamical influence is uncorrelated with its essentiality 
as measured by knock-out phenotype. We do find that 
in our data that evolutionary rate is also negatively cor- 
related with essentiality, but this correlation is typically 
weaker than with dynamical influence. We also examine 
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FIG. 1: Species tree used for measuring evolutionary rate. 
This tree is that obtained from concatenating all sequences 
studied, and the scale bar is in synonymous substitutions. 

the correlation of dynamical influence with previously es- 
tablished predictors of protein evolutionary rate. In par- 
ticular, we find that although dynamical influence and 
expression level are positively correlated, they appear to 
exert independent effects on evolutionary rate. 

The advent of systems biology has led to much finer 
knowledge of the role played by specific proteins in their 
network contexts. Here we have leveraged this knowl- 
edge to measure protein functional importance by the 
influence of that protein on the dynamics of its network. 
We find that the dynamical influence of a protein is neg- 
atively correlated with its evolutionary rate and uncorre- 
cted with its knock-out essentiality. These observations 
offer a finer perspective on the role of functional impor- 
tance in the protein evolution. 



Methods and Materials 
Data and statistics 

Dynamical biochemical models encoded in the Sys- 
tems Biology Markup Language [IT] and formulated as 
systems of ordinary differential equations were obtained 
from the 14th release of the BioModels database [12] . We 
considered all deterministic models which involved 8 or 
more distinct proteins, as annotated by their UniProt [T3] 
identifiers, yielding 10 models. One additional model [T4"] 
was obtained from the Molecular Systems Biology struc- 
tured data archive, and another |15j was converted to 
SBML from published Matlab scripts. 

Protein sequence alignments were obtained from the 
Homologene database |16j . after mapping UniProt iden- 
tifiers to UniGene identifiers. Corresponding DNA se- 
quences were obtained from GenBank. 

PAML [T7] was used to obtain a maximum-likelihood 
estimate of dN/dS for each protein over the tree shown 
in Figure [T] Only proteins for which 3 or more sequences 
were available were considered. 

Following [3 , proteins were identified as essential or 
non-essential based on targeted deletion experiments 
compiled in the Mouse Genome Database (http:// 
[www . inf ormat ics . j ax . org ) . A protein was deemed 
essential if it resulted in a phenotype of "lethality- 
prenatal /perinatal" or "lethality-postnatal", including 
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FIG. 2: (a) Reactions indicated with bold arrows in this ex- 
ample network are considered when calculating the dynamical 
influence of protein A. Note also that the external protein lig- 
and L would not be considered in our analysis, (b) Dynamical 
trajectories are shown for the active forms of proteins A and 
C. The bold lines are trajectories given the basal rate con- 
stants, while the dark and light shaded regions indicate the 
change in dynamics upon increasing k^ or fcj, respectively. 
In this case fcj" has a larger effect than fej, so the dynamical 
influence of is greater than the influence ft J of fcj . 

more specific classifications. We expect this essential 
vs non-essential characterization to be well-conserved 
among vertebrates. 

Mouse mRNA average expression and tissue specificity 
data were obtained from [3J. 

Correlations are assessed using Pearson's linear corre- 
lation coefficient. Reported p- values are from one-sided 
permutation tests. They correspond to the probability 
by chance of the correlation having the expected sign 
and being of the observed magnitude or larger. 



Dynamical influence 

To quantify the change in network dynamics as the 
reaction rate constants k vary from their published values 
k*, we introduce the sensitivity measure Kf 




dt. (1) 



conditions c 



This measure sums over all molecular species y in the 
model, and we sum over all conditions c considered in the 
original model publication. Most commonly, the condi- 
tions considered are different levels of stimulation by an 
external ligand. The normalization a y is taken to be the 
maximum of y over all conditions considered. See Fig- 
ure [2jb) for an illustration of how varying rate constants 
changes dynamics. The Atj were calculated using Sloppy- 
Cell (http://sloppycell.sourceforge.net), and the 
scripts used are provided in supplementary data. 
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We define the dynamical influence of a protein to be 
the geometric mean of the influence of all rate constants 
for all reactions it is involved in: 



Insulin/EGF signaling 



Heregulin/EGF signaling 



D=(k 



* / gcom ' 



(2) 



The Ki are only considered for reactions in which the 
protein participates by itself, not in a complex. See Fig- 
ure]^ a) for an example. Several models consider pertur- 
bations by externally manipulated protein ligands; these 
ligands were excluded from consideration, as the manip- 
ulations considered in the models may not be physiolog- 
ical. Additionally, the TNF receptor was excluded from 
our analysis of the apoptosis model |15j , because it was 
modeled non-mechanistically. 



Results 

Proteins with greater dynamical influence evolve 
more slowly 

To measure the influence of a particular protein on 
network dynamics, we first consider the influence of each 
rate constant of each reaction on network dynamics. The 
influence Ki of rate constant i is defined as the summed 
and integrated change in all molecular species trajecto- 
ries as that rate constant is changed (Equation [T| . See 
Figure [2jb) for an illustration of how changes in rate 
constants are reflected in changes in dynamics. We sum 
influence over all molecular species because it is not clear 
which aspects of each network's dynamics are relevant to 
an animal's fitness. Results are, however, qualitatively 
similar if we consider changes in only a select subset of 
"output" molecular species (see online supporting data) . 
Given the influence of each rate constant, the influence 
D of a protein is then defined as the geometrical mean 
of the influence of all rate constants for all reactions it is 
involved in (Figure^ a)). 

We measure the evolutionary rate of each protein via 
the ratio of the rate of nonsynonymous (amino acid 
changing, dN) to synonymous (amino acid preserving, 
dS) genetic mutation, as inferred over the tree seen in 
Figure [T| (Results are qualitatively similar if we consider 
only the tree of mammals, rather than vertebrates (see 
supporting data).) 

Figure [3] plots, for each system, protein evolutionary 
rate dN/dS versus protein influence D on network dy- 
namics. We consider each system separately, because 
it is not clear that changes in the dynamics of, for ex- 
ample, arachidonic acid metabolism and ErbB signaling 
would yield comparable fitness effects. Table [i] summa- 
rizes these results. 

We expect that proteins with greater influence on net- 
work dynamics are functionally more important and thus 
should evolve more slowly. For 11 of the 12 systems 
studied, there is indeed a negative correlation between 
protein evolutionary rate and dynamical influence (Ta- 
ble [I]). The single exception is the /3-adrenergic signal- 
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FIG. 3: Relationship between protein evolutionary rate and 
dynamical influence. Each point represents a particular pro- 
tein in each system. Proteins deemed essential in mouse 
knock-out studies are plotted in black, non-essential in grey. 
There was no essentiality data available for proteins plotted 
in white. 



ing model, which may be unreliable because it is a well- 
mixed approximation to a spatial model that was actu- 
ally fit to data. (Technical limitations presently preclude 
us from considering the partial-differential-equation spa- 
tial model in this analysis.) Individual p-values are 
not dramatic; each plot includes few points because 
building such detailed dynamic models is currently low- 
throughput, and the models are thus relatively small. If 
all the systems were completely independent, the odds 
of 11 of 12 showing a negative correlation by chance are 
12/2 12 , which is less than 0.3%. The first 7 systems in Ta- 
ble [I] do share some proteins between them, because the 
EGF signaling pathway is the most often-modeled system 
in higher eukaryotes. If we eliminate all proteins shared 
between two or more systems, then 10/11 show a neg- 
ative correlation, expected by chance at the 0.5% level. 
(The smaller NGF/EGF signaling model is eliminated in 
this comparison because all save one of the proteins in it 
are shared.) See supporting data for details. 

The ratio dN/dS is affected by both changes in the 
substitution rate dN and in the neutral mutation rate 
dS. In the supporting data, we show that the correlation 
between dN/dS and D we observe here is not due to 
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system r dN/dStD (p-val, N) r dN / dS}E (p-val, N) rp, g (p-val, N) r dN/dSiD \ E 



Insulin/EGF signaling [14] 


-0.35 (0.12, 13) 


-0.11 (0.37, 12) 


-0.14 (0.36, 12) 


-0.37 


Heregulin/EGF signaling [TH] 


-0.27 (0.18, 14) 


+0.20 (0.74, 13) 


-0.43 (0.08, 13) 


-0.24 


NGF/EGF signaling (smaller) [19] 


-0.65 (0.05, 8) 


-0.22 (0.43, 7) 


+0.05 (0.43, 7) 


-0.47 


NGF/EGF signaling (larger) [20] 


-0.43 (0.04, 17) 


-0.16 (0.30, 15) 


+0.17 (0.70, 15) 


-0.64 


EGF receptor endocytosis [21] 


-0.10 (0.32, 25) 


-0.44 (0.04, 17) 


-0.36 (0.08, 17) 


-0.44 


Wnt/ERK crosstalk [21] 


-0.32 (0.18, 10) 


-0.98 (0.14, 7) 


+0.46 (1.00, 7) 


+0.65 


IL6 signaling [23] 


-0.28 (0.24, 10) 


-0.08 (0.34, 9) 


+0.18 (0.56, 9) 


-0.08 


Gl cell cycle progression [24 


-0.31 (0.20, 10) 


-0.26 (0.31, 7) 


-0.28 (0.25, 7) 


-0.56 


Rho-kinase activation [25] 


-0.21 (0.24, 16) 


-0.04 (0.45, 12) 


+0.26 (0.75, 12) 


-0.23 


Arachidonic acid metabolism [26] 


-0.53 (0.06, 10) 


-0.36 (0.29, 7) 


+0.98 (1.00, 7) 


+0.67 


TNF-driven apoptosis [15] 


-0.14 (0.36, 12) 


+0.09 (0.60, 9) 


+0.11 (0.61, 9) 


+0.08 


/3-adrenergic signaling [27 


+0.39 (0.87, 10) 


-0.40 (0.24, 7) 


-0.21 (0.34, 7) 


+0.31 


expected sign 






+ 




total of expected sign 


11 


10 


7 


8 



TABLE I: Correlations between evolutionary rate dN/dS, dynamical influence D, and essentiality E. p- values are from one-sided 
permutation tests, and N is the number of proteins in each correlation. 



correlation with dS. 



Proteins with greater dynamical influence are not 
more essential 

The most common measure of a protein's functional 
importance is essentiality, whether or not an organism 
can survive without it. As in [3], we assessed essentiality 
by whether targeted knock-out of a protein yielded lethal- 
ity in mice. Table |T] shows that 10 of 12 systems show 
the expected negative correlation between essentiality E 
and evolutionary rate, and that the correlations are typ- 
ically weaker than that between dynamical influence and 
evolutionary rate. 

Perhaps surprisingly, we find little correlation between 
dynamical influence and essentiality. The correlation is 
only positive for only 7 of 12 models (p- value 0.39). This 
suggests that the correlation we observe between D and 
dN/dS is independent of the correlation between E and 
dN/dS. As seen in Table [I] the partial correlations be- 
tween evolutionary rate and dynamical influence typi- 
cally remain negative when controlling for essentiality. 
Of the 11 systems with a negative correlation between 
dN/dS and D, three do change sign when controlling for 
essentiality. It is difficult to assess the significance of 
these sign flips, however, because in two of these three 
cases only one protein was essential or nonessential. 



Other potential correlates 

The most well-established correlate with protein evo- 
lutionary rate is level of expression. Table |TT| reports 
correlations in our data between evolutionary rate and 
expression-related variables. 



To measure expression level, we used the average ex- 
pression level across 61 mouse tissues, as compiled by [3]. 
As found in other studies [2], evolutionary rate is consis- 
tently negatively correlated with expression level in our 
data (Table [TTJ) . Moreover, dynamical influence and ex- 
pression level are positively correlated in 8 of 12 cases. Of 
the 11 systems for which evolutionary rate and dynam- 
ical influence are negatively correlated, in only 2 cases 
does controlling for expression level via partial correla- 
tion change the sign. This suggests that dynamical influ- 
ence and expression level, although correlated with each 
other, exert mostly independent effects on evolutionary 
rate. 

In vertebrates, expression level may vary from tissue 
to tissue, so we also we consider data on tissue speci- 
ficity r compiled by [3], Tissue specificity is consistently 
positively correlated with evolutionary rate (Table [LT| ; 
proteins expressed in few tissues evolve faster. Dynami- 
cal influence and specificity are negatively correlated in 
9 of 12 cases, indicating that influential proteins tend 
to be expressed in many tissues. Controlling for tissue 
specificity, dynamical influence and evolutionary rate re- 
main negatively correlated for 10 of the 11 systems for 
which they began negatively correlated, suggesting that 
these factors independently influence the rate of protein 
evolution. 

We have also examined correlations with gene com- 
pactness [3] and the number of reactions a protein par- 
ticipates in (see supporting data). We find no consistent 
correlation between a protein's dynamical influence and 
it's genetic compactness. We do find that there is a neg- 
ative correlation between a protein's dynamical influence 
and the number of reactions it participates in. In both 
cases, however, it appears that dynamical influence ex- 
erts an independent effect on evolutionary rate. 
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system 


r dN/dS,x (p-val, N) r D ,x (p-val, N) 


r dN/dS,D\X 


r dN/dS,T (P- Val > N) 


r D,r (p-val: N) 


r dN/dS,D\r 


T 1" /T71/"1T71 1" [771 

insulin/ bG-b signaling |14| 


nvQ 1 n m 1 i \ 
-U. io (U.U1, 11 J 


i n 11 f'l nn 1 1 \ 
+U. ( J (l.UU, 11 ) 


+0.42 


+0.21 (0.71, 11) 


-0.08 (0.40, 11) 


-0.43 


Heregulin/riGl signaling [18] 


ri/ic; / n no 1 n\ 
-U.45 (U.Uo, 1UJ 


i n oo /not; ~\ r\\ 
-f-U.oo (U.OO, 1U; 


-0.07 


+0.14 (0.66, 10) 


-0.05 (0.47, 10) 


-0.23 


INGr jhAjF signaling (smaller J |ly| 


n on ( n no c; "\ 

-U.oy (U.zz, o J 


i n aci ( n co k\ 
+U.by (U.oo, ) 


-0.88 


i n oa 1 r~\ no tr\ 

+0.84 (0.98, 5) 


r\ / ■ f /nil r\ 

-0.66 (0.11, 5) 


-0.73 


NGl /bGl signaling (larger) 1201 


-U.Oo (U.Uo, Yl) 


i n crn /n cia 1 o\ 
+U.oU (U.y4, 12J 


-0.46 


+0.44 (0.92, 12) 


-0.28 (0.19, 12) 


-0.56 


T71*"1T~1 j_ 1 i |ni I 

iiCil receptor endocytosis [21] 


-U.4o (U.U1, zo ) 


i n i n ( n oo oq"\ 
-f-U.iy (U.oz, zo J 


-0.14 


+0.43 (0.98, 23) 


f\ r\f\ 1 1 \ f (\ oo\ 

-0.00 (0.50, 23) 


-0.23 


Wnt/h/KK crosstalk \Z2\ 


n o'v /n r\n *7\ 

-0.87 (0.00, 7J 


-0.3s (0.19, 7) 


+0.00 


i n i fy /n C7 *7\ 

+0.17 (0.67, 7) 


i n oo /n no t\ 

+0.83 (0.98, 7) 


+0.34 


ILo signaling [23J 


n /n no 
-U.Oo ^U.Uz, yj 


n n /n to a\ 
-v.lz yv.oy, y J 


-0.47 


+0.16 (0.72, 9) 


-0.10 (0.45, 9) 


-0.30 


Gl cell cycle progression |24| 


n /n a q i n\ 
-0.0/ (0.43, 1U) 


i n /i ^ /n nn in\ 
+0.46 (0.9U, 10) 


-0.31 


+U.00 (U.yo, 1U; 


r\ ao /n i 1 i n 1 ! 
-U.4o (U.ll, 1UJ 


-0.10 


Til i ■ j* lJ lor - ! 

Rno-kinase activation 25 


-0.09 (0.39, 13) 


+0.09 (0.66, 13) 


-0.15 


+0.45 (0.93, 13) 


+0.01 (0.56, 13) 


-0.18 


Arachidonic acid metabolism |26l 


-0.54 (0.12, 7) 


+0.47 (0.77, 7) 


-0.23 


+0.00 (0.51, 7) 


-0.06 (0.58, 7) 


-0.43 


TNF-driven apoptosis [15j 


-0.50 (0.08, 9) 


-0.04 (0.43, 9) 


-0.39 


+0.49 (0.90, 9) 


-0.28 (0.22, 9) 


-0.22 


/3-adrenergic signaling 27 


-0.66 (0.03, 8) 


-0.44 (0.13, 8) 


+0.42 


+0.09 (0.59, 8) 


+0.17 (0.66, 8) 


+0.57 


expected sign 




+ 




+ 






total of expected sign 


12 


8 


9 


12 


9 


10 



TABLE II: Correlations between evolutionary rate dN/dS, dynamical influence D, and expression-related variables, p- values 
are from one-sided permutation tests, and N is the number of proteins in each correlation 



Discussion 

We have introduced a fine scale measure of protein 
functional importance, based on the influence each pro- 
tein has on the dynamics of the biochemical network in 
which it is embedded. We have shown that in the sys- 
tems we study proteins with greater dynamical influence 
evolve more slowly, although they are no more essen- 
tial as determined by mouse knock-outs. Moreover, the 
correlation of dynamical influence with evolutionary rate 
appears to be independent of expression level and speci- 
ficity. It also appears to be independent of gene com- 
pactness and number of reactions a protein is involved 
in. 

Of the 12 systems tested, 11 show a negative correla- 
tion between protein dynamical influence and evolution- 
ary rate. The one exception, a model of /^-adrenergic sig- 
naling, is a well-mixed approximation to a spatial model 
for which we do not currently have the techniques to an- 
alyze parameter sensitivity, and it was the spatial model 
that was actually fit to biochemical data. The approxi- 
mate model may thus not properly reflect the actual net- 
work dynamics, explaining why this system is an excep- 
tion. This emphasizes the importance of model quality 
in assessing dynamical influence. Wc measured dynam- 
ical influence using computational models because it is 
at present technically very challenging to experimentally 
introduce small controlled perturbations to protein activ- 
ity. All models, however, are approximations to reality, 
which introduces noise into our measure of dynamical 
influence. In particular, the models currently available 
of necessity omit many potential network components 
and interactions. These omissions may explain the many 
cases in which a protein has low measured dynamical in- 
fluence but evolves slowly. (Points clustered near 0,0 in 
Figure [3]) . The rate constant values used in the current 
models are also likely inaccurate. It appears, however, 



that these models nevertheless capture the correct sensi- 
tivity to changes in rate constants. As more complete and 
refined models are developed, we may find that the ob- 
served correlation between dynamical influence and evo- 
lutionary rate grows stronger. 

Many proteins are modular, with different domains 
performing different reactions. Here we measure se- 
quence divergence of whole proteins and define dynam- 
ical influence using all reactions in which a protein is 
involved. Considering the evolution of each protein do- 
main separately and computing its influence using only 
the reactions it is actively involved in may make our 
analysis more powerful. Similarly, we eliminated protein 
complexes from analysis, but future studies may identify 
which components of a complex participate in which reac- 
tions. Both these refinements are difficult at present, be- 
cause models are not typically annotated with the neces- 
sary information. As domain-oriented rules-based mod- 
els |28] grow in prominence, this limitation may disap- 
pear. 

We measure protein dynamical influence by first as- 
sessing how changing each reaction rate constant affects 
network dynamics, and we measure evolutionary rate by 
sequence divergence. However, changes in a protein se- 
quence do not necessarily correspond to changes in rate 
constants [53] • Ideally, we would directly measure the 
evolutionary divergence in rate constants, but at present 
these quantities are difficult to measure in even a sin- 
gle organism. Computational methods that predict the 
impact of particular sequence changes on protein struc- 
ture (e.g. pOj) may provide a more relevant measure of 
evolutionary change. However, given that that we see 
a correlation between dynamical influence and protein 
sequence evolution, we expect that the correlation be- 
tween dynamical influence and more direct measures of 
rate constant evolution would be even stronger. 

A further underlying assumption in our analysis is that 
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the structure of the network remains constant over the 
evolutionary distances considered. A recent study of net- 
work structure evolution in eukaryotes indeed found that 
it is typically well-conserved among vertebrates. The sta- 
bility of the networks we consider is further supported 
by the fact that the correlations we observe are stronger 
when the tree of vertebrates is used rather than the tree 
of mammals. The reduction in noise resulting from using 
more sequence in assessing dN/dS thus appears to out- 
weigh noise from possible changes in network structure 
or function. 

Here we have examined evolutionary rate using bio- 
chemical pathway models. Other authors have also 
taken a pathway perspective evolutionary rate. Ram- 
say et al. showed that proteins acting earlier in partic- 
ular plant metabolic synthesis pathways evolved more 
slowly than those downstream [31j . Similarly, Green- 
berg et al. showed that proteins in the metabolic "core" 
or that share metabolites with other enzymes evolve 
more slowly [35]. Other authors have used flux-balance 
analysis to consider the steady-state properties of the 
yeast metabolic network: Vitkup et al. showed that 
proteins mediating greater metabolic flux evolved more 
slowly [33], while Wang et al. showed that proteins with 
larger knock-out effect on the steady-steady metabolic 
network evolve more slowly [3]. Here we consider not 
only structure or steady-state behavior, but also dynam- 
ics. We also consider a different class of networks, as 
most of our examples are signaling networks, the excep- 
tions being models of the cell cycle and arachidonic acid 
metabolism. 

It is perhaps surprising that in these systems dynam- 
ical influence is independent of knock-out essentiality. 
This may reflect the presence of many reactions which 
must take place for survival, but whose precise rate mat- 
ters little, perhaps due to other forms of regulation. Pro- 
teins involved in such reactions would have dramatic ef- 
fects if knocked out (and thus be deemed essential), but 
the network would be relatively insensitive to changes in 
their sequence (and thus they would have low dynamical 



influence) . 

A great deal of effort has been devoted to teasing apart 
which correlates of evolutionary rate exert independent 
effects. This has proven very difficult, and there is no 
well-established statistical test that may be used. Drum- 
mond et al. have shown that partial correlation may over- 
estimate independence between noisy factors, and they 
argue in favor of principle components regression |34j . 
Principle components regression, however, has similar 
difficulties when variables differ in their noisiness |35j . 
Here we have used partial correlation analysis, as it is 
simple and can be applied to the small number of data 
points we have for each system. Our results are thus 
suggestive of independent effects between dynamical in- 
fluence and other factors, but they are not definitive. 
It does seem, however, that essentiality and dynamical 
influence are themselves almost uncorrelated, implying 
that their effects on evolutionary rate are independent. 

We have introduced a measure of protein functional 
importance, dynamical influence, that assesses the effects 
of small changes in protein reaction efficacy on network 
dynamics. Dynamical influence is negatively correlated 
with evolutionary rate as measured by dN/dS, but it is 
independent of essentiality, a commonly used but much 
coarser measure of importance. By taking a finer view 
of protein importance, our results shed light on the con- 
troversy as to how much functional importance matters 
for evolution, suggesting that more important proteins 
do indeed evolve more slowly. 
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